rank | frequency | n-gram |
---|---|---|
1 | 119286 | -r |
2 | 91631 | -a |
3 | 83629 | -i |
4 | 79837 | -n |
5 | 61546 | -m |
rank | frequency | n-gram |
---|---|---|
1 | 56013 | -um |
2 | 47823 | -ar |
3 | 35240 | -ur |
4 | 27943 | -nn |
5 | 26451 | -na |
rank | frequency | n-gram |
---|---|---|
1 | 21470 | -num |
2 | 20633 | -inn |
3 | 16023 | -nar |
4 | 11236 | -nni |
5 | 9568 | -inu |
rank | frequency | n-gram |
---|---|---|
1 | 8033 | -rinn |
2 | 7882 | -unum |
3 | 6576 | -sins |
4 | 5880 | -anna |
5 | 5639 | -nnar |
rank | frequency | n-gram |
---|---|---|
1 | 5807 | -urinn |
2 | 4813 | -ingar |
3 | 3367 | -arnir |
4 | 3147 | -innar |
5 | 2472 | -urnar |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings